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We study theoretically the denaturation of single RNA molecules by mechanical stretching, focus- 
ing on signatures of the {un)folding pathway in molecular fluctuations. Our model describes the 
interactions between nucleotides by incorporating the experimentally determined free energy rules 
for RNA secondary structure, while exterior single stranded regions are modeled as freely jointed 
chains. For exemplary RNA sequences (hairpins and the Tetrahymena thermophila group I in- 
tron), we compute the quasi-equilibrium fluctuations in the end-to-end distance as the molecule is 
unfolded by pulling on opposite ends. Unlike the average quasi-equilibrium force-extension curves, 
these fluctuations reveal clear signatures from the unfolding of individual structural elements. We 
find that the resolution of these signatures depends on the spring constant of the force-measuring 
device, with an optimal value intermediate between very rigid and very soft. We compare and 
relate our results to recent experiments by Liphardt et al. [Science 292, 733-737 (2001)]. 



The advent and refinement of techniques to apply and 
measure forces on single molecules has allowed remark- 
able experiments probing the elastic properties and struc- 
tural transitions of bioinolecules such as DNA (see, e.g. . 



Bockelmann et alllToOSllMaier et aL. "2000': 'Smit h et all 
1996|) and proteins (see, e.g., Jlicf ct al., 1997.) . Re- 



cently, single-molecule experiments using optical tweez- 
ers were also performed to study the unfolding of small 
RNA molec ules under an applied force (Liphar dt et all 
l20nil 120021) . RNA molecules, besides functioning as 
messengers of sequence information in the process of 
protein synthesis, also have many biological functions 
that intricately depend on a precisely folded RNA struc- 
ture, e.g., a s rib osomal RNA or as self-splicing in- 
trons ijCechL ll99rJ) . The formation of RNA structure 
('RNA folding') is therefore an important biophysical 
process, which however is currently not unde rstood in 
sufficient deta il llTinoco and Bustainantel 1999() . Pulling 
experiments ijLiphardt et al .L 120011 I2002r possibly in 
combination with single-m olecule fluorescence methods 
ijZhuang et all l2000l l2002|) , promise to reveal important 
aspects of RNA structure, its folding pathways and ki- 
netics, and eventually its biological function. 

The experiment of iLiphardt et all 



200H) has shown 
that pulling on simple structural units of RNA, e.g., a 
single hairpin, yields characteristic features in the force- 
extension curve (FEC), which can be used to deduce the 
unfolding free energy and the size of the structural ele- 
ment. Moreover, the observation of end-to-end distance 
time-traces at constant force revealed that the unfold- 
ing and refolding of these simple structures proceeds di- 
rectly without intermediates, and the opening/closing 
rates could be extracted from the time-traces. From 
the theoretical side, it is interesting to ask how far, in 
principle, the pulling approach could be pushed to study 
larger RNA molecules and how the resolution of such 
approaches depends on parameters of the experimental 
setup. Here, we ex pand our previous m odel for RNA 
pulling experiments ijGerland et all l200lj) , and address 



these questions. The model incorporates the experimen- 
tally know n free energy r ules for RNA secondary struc- 
ture (, Walter et all Il994|) and a polymer model for the 
elastic properties of single-stranded RNA (ssRNA), but 
neglects pseudoknots and tertiary interactions, the ener- 
getics of which are currently poorly characterized. 

Our model yields predictions for force-extension mea- 
surements, including fluctuations, and the mechanical 
(un)folding pathway for any given RNA se quence. Below, 
we be gin with the P5ab hairpin used bv lLiphardt et all 
l)200lD and demonstrate that our model yields a FEC 
which is in semi-quantitative agreement with the exper- 
imental curve. This agreement gives us confldence that 
our model is sufficiently realistic to permit its use to 
explore general questions regarding sequence-dependent 
signatures in mechanical single-molecule experiments on 
RNA. 

We first address the question of intermediates in the 
unfolding pathway and show that, according to our 
model, a small modification in the sequence of the P5ab 
hairpin can change its two-state folding behavior and 
introduce a locally dominant intermediate. However, 
whether this intermediate state can be observed through 
quasi-equilibrium fluctuation measurements critically de- 
pends on the experimental conditions: If the force- 
measuring device is a "soft spring" , the hairpin unfolds 
without any visible intermediates. Only when the force 
is measured with a stiff spring, an intermediate state can 
be observed. 

We then consider a larger RNA molecule, the group I 
intron of Tetrahymena thermophila with a sequence of 
about 400 bases and a known seco ndary structure con- 
taining many individual elements l)Cechl Il993|) . Previ- 
ous theoretical work predicted that equilibrium FEC's of 
large RNA's are smooth and display no secondary struc- 
ture dependent features, due to a compensation effect 
between individual structural elements ( Gerland ct alj, 
I2001D . Here, we find that even in the absence of 
structure-based features in the FEC, measurements of 
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the equihbrium fluctuations of the entire molecule can 
still be useful to obtain information on the (un)folding 
pathway. Also, for this longer molecule a good choice of 
the stiffness of the force-measuring device is important 
for the determination of the pathway. 

Finally, we compare the mechanical unfolding process 
studied here with the more conventional thermal unfold- 
ing. Again using the group I intron as an example, we 
find that the individual structural elements display signif- 
icantly sharper opening transitions for force- induced than 
for thermal denaturation wi thin our model . Ind eed, in 
UV absorption experiments l)Baneriee et al.lfl993|) . ther- 
mal melting of the intron shows only one broad signature 
associated with the opening of the secondary structure. 



Model 

We consider an experimental setup where the two 
ends of an RNA molecule are attached to a force- and 
extension-measuring device, e.g. an atomic force micro- 
scope or beads trapped by optical tweezers. For the sake 
of concreteness, we use the optical tweezer setup sketched 
in Fig.^a) as an example here. In an idealized theoreti- 
cal model, the RNA molecule (in essence, a highly nonlin- 
ear elastic element for the present context) is connected 
in series with a linear harmonic spring, and the total ex- 
tension is externally controlled, see Fig.QJb). The linear 
spring models the force-measuring device. E.g., for op- 
tical tweezers, its harmonic potential approximates the 
potential of the optical trap, and the spring constant A 
is adjustable in the experiment through the laser inten- 
sity. The extension of the spring, Rg , corresponds to the 
position of the bead with respect to the minimum of the 
trapping potential, and is measured as a function of the 
total extension Rt . The average force acting on the RNA 
molecule and its average extension are 
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Here, (. . .) denotes a thermal average over all accessi- 
ble conformations of the RNA molecule and the spring 
at fixed total extension Rt. Throughout this article, we 
assume that the pulling experiment is performed slowly 
enough, such that the RNA molecule can fully sample its 
conformational space on timescales which are short com- 
pared to the external timescale of the imposed stretching 
process. This limit corresponds to the quasi-equilibrium 
regime, where the work dissipated in the pulling process 
is negligible. The average (. . .) then includes an average 
over all possible secondary structures of the molecule. 

Partition function. To calculate the averages in 
Eqs. and ||2Jl, as well as the averages of other ob- 
servables to be introduced below, we first determine the 
partition function, Z{Rt), of the RNA molecule and the 
spring at fixed total extension Rf. In previous work 
ijGerland et al.ll200l[l . we computed the partition func- 
tion of the RNA molecule alone in the fixed distance 




FIG. 1 Sketch of the system considered here, (a) In a typical 
experimental setup, the two ends of an RNA molecule are 
attached to beads the position of which is controlled, e.g., 
with optical tweezers, (b) In a theoretical model, the potential 
of the optical trap can be approximated by a linear spring 
(with spring constant A). This spring is connected in series 
with a highly nonlinear elastic element (the RNA molecule, 
usually with an additional linker). The total extension, Rt, is 
externally controlled, while the individual extensions, Ra and 
R, undergo thermal fluctuations. 



ensemble. Here, we explicitly take the force-measuring 
spring into account; varying the spring constant A in- 
terpolates between the fixed distance ensemble for stiff 
springs (large A) and the fixed force ensemble for soft 
springs (small A). Intermediate spring constants effec- 
tively put the RNA molecule into a mixed ensemble. The 
explicit consideration of the spring in the model is nec- 
essary here, not only for a closer modeling of the actual 
experimental setup, but also from a theoretical point of 
view, since the force fluctuations diverge in the fixed dis- 
tance ensemble and our aim is to study fluctuation mea- 
surements. 

A model for an experimental setup such as the one 
sketched in Fig. ^a) requires two separate parts, one 
that contains only information on the free energies for 
the secondary structures of the RNA sequence, and an- 
other which de s cribes the polymer properties of ssRNA 
llGerland et all 1200 iD . The coupling between these two 
parts is through the total length of the exterior ssRNA 
segments of the molecule, i.e. the number m of bases that 
actually "feel" the applied force, see Fig.Q^a). For every 
fixed total extension Rt, the system has to find a com- 
promise between lowering the RNA binding energy by 
decreasing m, and gaining entropy (plus lowering elastic 
energy) by increasing m. The secondary structure part is 
given by the partition function, Q(m), summed over all 
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secondary structures of the RNA molecule with a fixed 
number m of exterior single-stranded bases^. The poly- 
mer properties of ssRNA then enter through a function 
Wtot {Ru rn), which denotes the total end-to-end distance 
distribution of a ssRNA molecule of m bases in series 
with the spring^. The total partition function, Z{Rt) is 
then simply the convolution of Q(m) with Wtot{Rum), 



(3) 



The detailed calc ulation of Q(m) is described in 
iGerland et alJ l)200l|) . Briefly, the calculation takes the 
experimentally determined rules for th e binding free en- 
ergies of RNA secondary structures (;Walter et all Il994l) 
into account, but neglects tertiary structure effects and 
pseudoknots. Our method is based on a recursive 
meth od to calculate the partition function (McCaskill 
ll99Cll) as implemente d in the 'Vienna RNA package' 
l|Hofacker etallllQg^ . but introduces an additional re- 
cursion relation for the calculation of Q{m). 

The total end-to-end distance distribution of a ssRNA 
molecule of m bases in series with the spring can be ex- 
pressed as 



Wtot (i?t;m)= / di? WRNA(i?;TO) 



, (4) 



where Wr,na(^;w) denotes the distribution of the ss- 
RNA molecule alone. We model the ssRNA as an elas- 
tic freely jointed chain, i.e., as a chain of segments with 
unrestricted relative orientations, whereas the segment 
length r is constrained by a harmonic potential V{r) = 
K(r — 6)^/2. The average segment length h corresponds to 
the Kuhn length of a non-interacting ssRNA chain. The 
number of segments of the elastic freely jointed chain 
used to represent an RNA molecule with m bases is cho- 
sen as ml/h where / is the base to base distance of ssRNA. 

This yields WRNA(i?;™) ~ e"''-", where 

C is a normalization constant, q{h) ~ (e~^^ ''), and h 
is de termined from R — ^ ■^\ogq(h) l)Gerland et all 
120011) . Such a model has been shown to describe 
the elastic properties of ssDNA molecules (jMaier et all 
[2000; Montanari and Mczard, 2001) . Since we are not 
aware of the corresponding data for RNA, we use the 
DNA val ues for the polymer parameters as obtained by 
iMontana ri and Mez ard ( 200JJ through fitting to the ex- 
periment of'Maie^t.alJ (|2000j), i.e. / = 0.7nm, 6=1.9nm, 
and (k/ZcbT)"^/^ = O.lnm (we do not expect a large 
difference in the single strand properties of DNA and 
RNA, because of the high similarity between their chem- 
ical structures). We take the free energy parameters for 



RNA secondary structure as supplied with the Vienna 
package (version 1.3.1) at room temperature T = 25°C. 
The salt concentrations at which the free energy param- 
eters were measured are [Na"*"] = IM and [Mg^"*"] — OM. 

Observables. From the partition function (O we get 
the total free energy F{Rt) = ~kBT\ogZ{Rt), the av- 
erage force (/) = dF{Rt) I dRt-, and through Eqs. (P) 
and l|2Jl also the average extension {R) of the RNA 
molecule. The equilibrium fluctuations in the extension 
of the RNA molecule are given by {SR)"^ = {R^) - = 
[A — d'^F/ dR^]^/ fiX^ . Using these relations, one can 
show^ that the variance is related to the derivative of the 
force-extension curve as measured in the mixed ensemble 
through 



X + d{f)/d{R) 



(5) 



This exact relation simply expresses the intuitive result 
that the extension of the molecule undergoes large ther- 
mal fluctuations in the flat regions of the force-extension 
curve. 

To study the unfolding pathway, we introduce the 
weight Xm{Rt) for the secondary structures with m exte- 
rior open bases at a given value of Rt, 



Xm{Rt) = 



Q{m)WtotiRt;m) 
Z{Rt) 



(6) 



which satisfy the normalization condition "^^Xm — 1. 
The binding probability of base i and j at given Rt may 
then be expressed as 



P^J{Rt 



XrniRt)Pi]im) , 



(7) 



where pij (m) denotes the base pairing probability in the 
ensemble of structures with fi xed m, which can be c alcu- 
lated as described previously l|Gerland et alll200l|) . 



Results 

We first consider the P5ab hairpin, which is the 
simplest RNA molec ule studied in the experiment of 
iLiphardt et al and is of a size that typically oc- 

curs as a structural subunit in larger RNA's. Fig. ^a.) 
shows its native structure and the experimental FEC 
for P5ab with added linker, taken with permission from 
Liphardt et al. (200^. The superimposed solid line 
shows the FEC that results from our model as described 



We account for the width of stems within the exterior single 
strand by increasing the base count m by three for each stem. 
^ For simphcity, we restrict the spring to be coUinear with the 
end-to-end distance of the RNA molecule. 



^ The derivation is straightforward after observing that 
d(R) dRtl dRt 1 - (1/A)92 F(iJt)/aiJ? ■ 
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above^ with a spring constant of A = 0.2 pN/nm. Here, 
we modeled the RNA-DNA hybrid linker, which in the 
experiment connects the RNA molecule to the beads, as 
a worm-like chain (WLC) with an experimentally known 
length oi L ~ 320n m. We used the worm-lik e chain in- 
terpolation formula l|Biistamante et iill Il994l) 



/wLc(-R) 



4(l-i?/L)2 



R 
L 



(8) 



where the value for the persistence length, Ip — 3.57nm 
and the (experimentally unknown) origin of the distance 
scale were obtained by fitting to the experimental FEC 
at low forces. The shape of the experimental FEC is well 
captured by the theoretical curve, with a characteristic 
"hump" indicating the opening of the hairpin (the force 
drop in the hump occurs, because the released single- 
strand creates "slack" in the polymer). However, the 
force at which the hairpin opens is overestimated by the 
theoretical model. To assess the origin of this discrep- 
ancy, we applied a simple salt correction to the RNA 
binding (free) energies (SantaLucia, 1998) to account for 
the different ionic conditions between the single-molecule 
experiment ([Na+] = 0.25M) and the standard condi- 
tions ([Na+] = l.OM). The resulting FEC (dashed hue 
m Fig. Ei) yields near-quantitative agreement with the 
experiment, indicating that the difference in ionic con- 
ditions accounts for most of the discrepancy. The re- 
maining discrepancy could be caused, for instance, by 
the freely jointed chain model for ssRNA or the unknown 
stacking energies for G-A basepairs. However, the good 
agreement we have achieved seems sufficient to justify the 
use of our model as a tool to study the general questions 
outlined in the introduction (in the following, we keep 
the salt concentration at the standard conditions). 

The EEC's in Fig. [21 contain useful information on the 
RNA molecule. For instance, the total binding free en- 
ergy and the size of the hairpin can be read off by com- 
paring to the FEC for a non-binding control sequence. 
Within our theoretical model, such control curves are 
obtained through a convolution of the WLC model © 
for the linker with the freely-jointed chain model for the 
single-stranded RNA. The resulting curves could also be 
used as a reference for the experimental data, although 
ideally the control curve should be measured, using e.g. 
an oligonucleotide with a homogeneous sequence (note 
that it suffices to record the control for a single sequence 
length, since it can be straightforwardly rescaled to an 
arbitrary length). Fig. [S^b) shows our theoretical FEC 
for P5ab both for a stiff spring (A = 0.2 pN/nm, solid 
line) and a soft spring (A = 0.01 pN/nm, dashed line). 



* The native structure of P5ab contains two non-standard G-A 
base pairs for which no quantitative information on the bind- 
ing e nergies is available. W e therefore follow the suggestion of 
Ref. iLiohardt et al.Ll2001^ and approximate the binding energy 
of G-A basepairs by replacing them with G— U pairs. 
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FIG. 2 Force-extension curve (FEC) for the P5ab hairpin 
with added linker, (a) Native struct ure of the hair pin, its 
FEC as experimentally determined bv lLiohardt et all |2001), 
and the FEC resulting from our model for two different salt 
concentrations, (b) FEC for P5ab as results from our model 
for two different spring constants A (solid and dashed line; 
see main text for details). The dotted line represents the 
FEC for a non-binding sequence of the same length as P5ab. 
Inset: End-to-end distance fluctuations as a function of the 
average extension. 



together with the control FEC (dotted) for a sequence 
of commensurate length. Since stretching the control se- 
quence requires only work against entropic and elastic 
forces, while stretching of the P5ab hairpin requires ad- 
ditional work to break the basepairs, the area between 
the two FEC's equals the total binding free energy of 
P5ab. As Fig. El^b) shows, not only the region of the 
P5ab FEC where the basepairs are opened (the "hump" 
region) contributes to this area, but there is also a signif- 
icant contribution from the initial part of the FEC. The 
initial part of the P5ab FEC corresponds to stretching 
the exterior single-stranded chain of the molecule, which 
is shorter than the full molecule and hence has higher 
entropic/elastic forces at the same extension. However, 
this additional entropic/elastic energy is "stored" in the 
molecule and is fully released when the hairpin is (adi- 
abatically) unzipped, i.e. in the "hump" region. Note 
that as long as the experiment is performed in quasi- 
equilibrium, the area under the FEC is independent of 
the spring constant, and therefore the same in the fixed- 
force and the fixed-distance ensemble. Loosely speaking, 
the FEC in the fixed-force ensemble could therefore be 
considered as the 'Maxwell construction' of the FEC in 
the fixed-distance ensemble. 

We now seek to characterize the unfolding process of 
the P5ab hairp i n with in our model. In the experiment of 
iLiphar dt et alJ (|200lfl . the time traces of the end-to-end 
extension at fixed forces around 14 pN show the char- 
acteristic pattern of a bistable system, i.e. the unfold- 
ing proceeds directly, without intermediates. To cal- 
culate theoretical time traces would require a full ki- 
netic treatment of the RNA folding/unfolding process 
which is beyond the scope of the present article. In- 
stead, we determine the amplitude of the equilibrium 
fiuctuations in the end-to-end extension, 5R, as a func- 
tion of the mean force or extension. Experimentally, 
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FIG. 3 Calculated force-extension characteristics for the 
modified P5ab hairpin (G-A basepairs replaced by A-A) for a 
hard (A — 0.2 pN/nm) and a soft (A = 0.01 pN/nm) harmonic 
potential of the optical tweezer, (a) The average FEC, which 
resembles very much the FEC in Fig. |2Ib). (b) The fluctua- 
tions of the extension as a function of the average extension. 
While the curve is very similar to the inset of Fig. |2Ib) for 
a soft spring, it develops a pronounced minimum correspond- 
ing to an intermediate state for a hard spring. The arrows 
indicate the positions at which the a;m-distributions shown in 
Fig. 2] were calculated. 



this amplitude would be determined as a time-average, 
SR^ = T-^ f^dt{R{t) -Rf, where R{t) denotes the 
recorded time trace of the end-to-end extension and R 
its average. Our theoretical prediction is for the infinite 
time limit, T — > oo, of this average. Note that the exact 
relation Eq. directly links the equilibrium fluctua- 
tions to the derivative of the averaged FEC. In practice, 
where the averaging period T is finite and limited to a 
few minutes by instrumental drift fLipha rdt et alJ . f200l[l . 
the amplitude 5R can be determined with much greater 
accuracy than the derivative of the time-averaged FEC, 
since taking the derivative strongly amplifies the statisti- 
cal error. Therefore, the quasi-equilibrium FEC and the 
equilibrium fluctuations, 5R, can effectively be regarded 
as two independent and complementary sources of infor- 
mation. 

The inset of Fig. |2Ib) shows the fluctuations 5R as 
a function of the average extension, (i?), in the range 
of extensions over which the hairpin opens. We observe 
a single peak of the fluctuations at the force/extension 
where the transition takes place, which is consistent with 
the two-state behavior found experimentally. Physically, 
this peak is caused by continual kinetic fluctuations be- 
tween the open and the closed state of the hairpin. 

Hairpin vi^ith intermediate. Next, we address the 
question whether the simple two-state behavior observed 
for P5ab is a generic property for hairpins of this typi- 
cal size. The discussion of this question also serves as 
a preparation for our ensuing study of a larger RNA 
with more complicated structure. To this end, we mod- 
ify the sequence of the P5ab hairpin slightly by replacing 
the two G-A basepairs with U's on both strands, which 
leads to a small interior loop in the hairpin. This change 
does not significantly affect the FEC, see Fig.|3Ia), how- 
ever the fluctuations 5R take on a qualitatively different 
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FIG. 4 Probability distributions in m-space for the modified 
P5ab hairpin. The left column is calculated with a soft spring 
(A = 0.01 pN/nm), and the right column with a stiff spring 
(A = 0.2 pN/nm). The curves in the top/middle/bottom row 
are calculated for different average extensions {R) correspond- 
ing to the positions indicated by the arrows in Fig. Ob) [first 
peak, minimum, and second peak in the fluctuation curve, 
respectively] . For the stiff spring, the distribution at the min- 
imum is almost entirely localized on the intermediate state. 
In contrast, the distribution is spread out over all three states 
for the soft spring at the same average extension. 



behavior, as shown in Fig. |3Jb): For a soft spring, the 
(5i?-curve shows only a single peak as before, but a stiff 
spring yields two maxima with a pronounced minimum 
in between. At this minimum, the configurational dis- 
tribution of the molecule is localized on an intermediate 
state, which strongly reduces the fluctuations in the ex- 
tension. This is demonstrated explicitly in Fig. 01 which 
shows the probability distribution in m-space, i.e. Xm as 
given by Eq. © (see caption for details). 

The stiffness of the external spring therefore limits the 
attainable resolution of the measurements: At low reso- 
lution the hairpin appears as a two-state folder, while an 
intermediate state appears at higher res olution. This ob- 
servat ion is in line with the comment of iFernandez et al 



2001) who discuss the experiment of lLiphardt'er^ 



( 20011) by analogy to patch clamp experiments, where 
an intermediate state in the opening of an ion channel 
appears only at high resolution. While an increase in the 
spring constant increases the resolution, it decreases the 
amplitude of the fluctuations, which renders them harder 
to detect. For the present purpose, the optimal value for 
A results from a tradeoff between low resolution and low 
amplitude. Clearly, increasing A can increase the resolu- 
tion only up to the point where the Soppiness of the RNA 
plus linker becomes resolution limiting (this point corre- 



t 

closed 



m 



t 

intermediate 



open 



FIG. 5 Particle with an attached spring in the free energy 
landscape of the modified P5ab hairpin. The other end posi- 
tion of the spring is externally controlled (see lGerland et all 
l200ll . for the details of the mapping to the RNA unzipping 
problem). The spring in this picture represents the measure- 
ment device, the linker, and the ssRNA and has an effective 
spring constant A. Its harmonic potential for A = O.OI pN/nm 
and A = 0.2 pN/nm is shown as the dashed and dash-dotted 
lines, respectively. With the soft spring, the particle can jump 
from the closed to the open state immediately as soon as the 
force is strong enough to overcome the energy barrier. The 
hard spring has a steeper harmonic potential and thus the 
particle will first jump into the intermediate state and later 
into the open state as the end of the spring is moved towards 
larger m. 



sponds approximately to A = 0.5 pN/nm for the hair- 
pin sequences with linker discussed above). This point 
marks the optimal choice for A, unless the spacial resolu- 
tion of the experimental apparatus is restrictive already 
at smaller values of A. The role of the spring constant 
in dynamic force spectroscopy measu rements is discussed 
bv lHevmann and GrubmiiUen l)2000l) . 

Intuitively, the fact that the appearance of the interme- 
diate state depends on the spring constant is best under- 
stood by mapping the unzipping problem onto the prob- 
lem of a particle with an attached spring in a random po- 
tential and coupled to a thermal bath (see Fig.[SJ. As the 
other end of the spring is moved in on e direction, the par- 
ticle performs a 'stick-slip' motion ijBockelmann et all 
Il998|) . With a stiff spring, the particle hops over short 
distances and tends to be localized, while the particle 
can make long jumps, sliding over many valleys, when 
the spring is soft. 

RNA molecule with multiple structural sub- 
units. Longer RNA molecules typically have 
a complex structure with many structural sub- 
units interacting t hrough w eak tertiary contacts 
l)Tinoco and Bustamantel Il999). A typical example for 
a structural subunit is the P5ab hairpin studied above. 



which is indeed extracted from a larger RNA molecule, 
the self-splicing intron of Tetrahymena thermophila with 
a sequence of 419 bases (Genbank # J01235). The known 
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FIG. 6 Secondary structure of the Tetrahymena thermophila 
group I intron as obtained from the Vienna package. The 
overall structure, and almost all individual basepairs, are 
identical to the one of the inactive conformation as deter- 
mined by iPan and Woodsonl il99^ (we follow the labeling 
introduced there). 



secondary structure of this intron comprises 19 structural 
elements labeled PI, P2, P2.1, P3, etc. It was the first 
RNA molecule to be shown to have a cata lytic activit y 
and has since been studied in great detail 
Its active, i.e. self-splicing, conformation contains a pseu- 
doknot, which is essential for the function. However, it 
also has a known stable and well ch aracterized inactive 
conformation without pse udoknot jPan and Woodsonl 
Il998t IZhuang et all l200 (?). The minimum free energy 
structure obtained from the Vienna RNA Package (which 
supplies the basis to our model) is shown in Fig. |S1 The 
overall structure, and almost all individual basepairs, are 
identical to the one of the inactive conformation as deter- 
mined bv lPan and Woodsonl ljl998l|) (we follow the label- 
ing introduced there) , which confirms that the RNA free 
energy rules describe the nucleotide interactions quite 
well. For the cyclized form of this i ntron, a very similar 
secondary structure was obtained bv l.Iaeger et"al] fl99m . 

For complex secondary structures such as the one 
shown in Fig. it is interesting to study the physical 
process of structure formation, whereby e.g. a newly 
produced RNA molecule folds into its preferred struc- 
ture(s). For instance, one may ask, in which order the 
structural elements in Fig.j^lform, i.e. what is the folding 
pathway? Mechanical single-molecule experiments offer a 
very controlled way to probe the folding process. In the 
quasi-equilibrium limit that we consider, the mechani- 
cal folding pathway is independent of (i) the direction, 
i.e. unfolding and refolding pathway are identical, and 
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(ii) specific experimental parameters such as the spring 
constant A. The latter is clear from the fact that the 
number of exterior unpaired bases, m, is a natural and 
well-defined reaction coordinate in the quasi-equilibrium 
limit. On average, the distribution Xm will always shift 
to larger m when Rt is increased, and hence A can only 
affect the resolution of individual states in m-space, but 
not the average order in which they are visited. 

How can one determine the (un)folding pathway in a 
single-molecule experiment? As we have seen, pulling on 
an isolated subunit produces a characteristic signature 
in the FEC, see Fig. [3 If every structural subunit in a 
larger RNA molecule would produce such a characteristic 
signature, one could directly read off the folding pathway 
from the FEC. However, our previous theoretical study 
predicted that the quasi-equilibrium FEC of RNA's with 
many subunits will typically show no dist inguishable sig- 
natures of individual subunits opening l)Gerland et all 
1200 li'l . This prediction is consistent with the experi- 
mental ob servation of smoo th EEC's for long ssDNA 
molecules (|Maier et al.L l2000^ . Three physical effects are 
responsible for the smoothness: (i) the 'Soppiness' of the 
external single-strand and the linkers, (ii) thermal fluc- 
tuations in the secondary structure, i.e. the contribution 
of suboptimal structures, and most importantly, (iii) the 
fact that changes in the extension of individual structural 
subunits can compensate each oth er, since only the tota l 
end-to-end distance is measured llGerland et all 120011) . 
As a result, quasi-equilibrium EEC's cannot be used to 
study the folding pathway (or the secondary structure) 
of larger RNA molecules. 

The smoothing mechanisms (ii) and (iii) can be 
strongly suppressed by performing the experiments at 
large pulling speeds which do not leave sufficient time 
for the molecule to sample the entire ensemble of dif- 
ferent secondary structures with similar number of ex- 
terior unpaired bases, m, and free energy of folding. 
Hence, nonequilibrium EEC's of larger RNA molecules 
should be very rugged and display signatures of individ- 
ua l structural elements opening; see also the discussion 
in iGerland' et all II2001D . This was indeed observed re- 
cently in the Berkeley group (S. Dumont & I. Tinoco Jr., 
private communication). From the theoretical side, the 
calculation of nonequilibrium EEC's is challenging, since 
the simple particle-in-a-landscape picture of Fig.[Slbreaks 
down in nonequilibrium: the free energy landscape is no 
longer well-defined. The secondary structures that are 
accessible for a molecule at a given extension and within 
a given time window depend on the present structure 
of the molecule and the detailed folding kinetics. While 
important steps towa rds a kinetic theory of RN A fold- 
ing have been taken l|lsambert and Siggial l200d(l . there 
is currently insufficient experimental information avail- 
able to construct a full theoretical model. 

Here, we consider instead the equilibrium fluctuations 
in the end-to-end distance of the entire intron, and ex- 
plore, within our theoretical model, how much informa- 
tion on the folding pathway can be obtained from this. 



While looking at the equilibrium fluctuations does not 
suppress any of the above-mentioned smoothing mecha- 
nisms, it does provide a greater resolution for the obser- 
vation of structural transitions, as already discussed. We 
will see in the following that this enhanced resolution, 
which is dependent on the spring constant, is sufficient 
to reveal major steps of the unfolding process. 

The quasi-equilibrium FEC of the intron is shown in 
Fig. [Tfa). Here, we have not added an additional linker 
in the calculation, since the intron is already fairly large 
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FIG. 7 Calculated force-extension characteristics of the 
Tetrahymena thermophila group I intron. (a) FEC. (b) Fluc- 
tuations in the extension of the ribozyme for different stiffness 
A of the force measuring device, (c) Fractional opening for 
all of the structural elements at A = 0.02 pN/nm. A black 
square indicates that the structural element is completely in- 
tact while a white square indicates that all base pairs of the 
element have been opened. It can be seen that the FEC lacks 
all structural features. In contrast, the fluctuations display 
pronounced peaks that coincide with the disappearance of 
specific structural elements. 



8 



and flexible, so that the effect of a comparatively short 
and stiff double-stranded linker is negligible. As ex- 
pected, the FEC displays no signatures of the secondary 
structure, mostly due to 'compensation' between differ- 
ent structural elements (^Gerland et al.i . 120011) . As for the 
P5ab hairpin above, we compute the equilibrium fluctu- 
ations SR, which would correspond to the average stan- 
dard deviation in an inflnitely long experimental time 
trace R{t). Obviously, our averaged SR contains less in- 
formation than experimental R{t) traces. Nevertheless, 
as Fig. [7fb) shows, already the equilibrium fluctuations 
SR display interesting sequence-dependent features, i.e. 
local maxima and minima. The minima can again be 
interpreted as intermediate states along the (un)folding 
pathway, and the maxima as structural transitions. Note 
that the central peak close to (i?) = 150 nm splits into 
two peaks as the stiffness of the spring is increased, in- 
dicating the appearance of a new intermediate on the 
(un)folding pathway. This illustrates again how the abil- 
ity to resolve intermediate states depends on the stiffness 
of the force-measuring device. For the choice of the latter, 
a trade-off between resolution of intermediate structures 
and fluctuation amplitude has to be respected. 

Within our theoretical model it is a simple matter to 
identify the structural transitions associated with the 
peaks in Fig. [7{b). To obtain a measure for the likeli- 
hood to flnd a given structural element at a given mean 
extension of the molecule, we have summed the basepair 
probabilities for all basepairs in the element and divided 
by the total number of basepairs in the intact element. 
Here, the basepair probabilities at a given extension are 
computed using Eqs. (|S1|7I). For each of the labeled struc- 
tural elements in Fig. |3 we plotted the normalized total 
basepair probabilities as a function of the mean exten- 
sion at A = 0.02 pN/nm in Fig. [7{c) using a greyscale 
code (black corresponds to presence of the element with 
probability one and white to absence of the element). We 
observe that the opening of most structural elements is 
localized to a relatively small extension interval, however 
typically the opening of several elements takes place si- 
multaneously. Roughly, the structural elements open in 
the following order: P7, Alt P3, P4, P5, P5a, PI, P2.1, 
P6a, P9.1, P9.1a, P9.2, P2, P9, P8, P6b, P5c, and P5b. 

By simultaneous inspection of Fig. [7fb) and (c), we 
can assign particular peaks in the fluctuation curve to 
the opening of particular elements. For instance, the 
central peak is associated with the parallel opening of 
P9.1, P9.1a, and P9.2, whereas the most stable elements 
P5b, P5c open at the final "hump" . Generally, elements 
that free a lot of single strand upon opening, such as a 
stem-loop structure with a large loop (e.g. P2.1), are 
associated with a distinct peak in the fluctuation curve. 
This peak reflects the large difference in extension be- 
tween the open and closed state of the element. 

To obtain the information displayed in Fig.[7{c) in the 
experiment will be more laborious, but should be possi- 
ble given the known secondary structure. Since at every 
mean extension (or force) only a small number (up to 
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FIG. 8 Thermal unfolding of the intron. (a) Specific heat 
curve, (b) Fractional opening for all of the structural ele- 
ments. Analogously to Fig. [7Jc) a black square represents a 
fully formed structure whereas a white square represents a 
structure that has fully opened up. 



3) structural elements open in parallel, one can expect a 
small number of characteristic plateaus in the experimen- 
tal R{t) traces. The identification of the plateaus with 
the corresponding elements can be obtained by cutting 
the molecule at several places between the known sec- 
ondary structure elements and performing the same mea- 
surement on subsequences. A similar approach was taken 
for recent nonequilibrium experiments in the Berkeley 
group (S. Dumont & L Tinoco Jr., private communica- 
tion). Obtaining the mechanical (un)folding pathway for 
a moderately sized RNA with known secondary structure 
therefore appears feasible by measuring the fluctuations. 

It is interesting to compare the mechanical unfolding 
studied so far with thermal unfolding. The thermal un- 
folding process of the group I intron can be studied theo- 
retically with the Vienna RNA package ijHof acker et al.l 
[1994) . Fig. IHl^b) shows the fractional opening of all struc- 
tural elements as in Fig. Oc), but plotted against tem- 
perature. We observe that the individual stems show 
a less sharp transition from closed to open than in 
Fig. [Tfc). As a result, the specific heat curve shown in 
Fig. IHIa) displays only a single broad peak for the un- 
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folding of the secondary structure (the specific heat is 
obtained from the total folding free energy, AG, through 
-Td^iAO/dT"^) This is consi s tent w ith UV absorp- 
tion experiments (iBaneriee et alJ.ll993|) that found only 
two broad peaks for the same RNA, where the low- 
temperature peak could be associated with melting of the 
tertiary structure and the high-temperature peak with 
the secondary structure. 

From Fig. |SIb), a rough order for thermal opening of 
the structural elements can be discerned: P4, P5, P5a, 
P6a, P7, Alt P3, PI, P2.1, P9, P8, P9.1a, P9.2, P9.1, 
P6b, P5b, P2, P5c. No structural element in this ther- 
mal (un)folding pathway is more than four list positions 
away from its place in the mechanical (un)folding path- 
way discussed above. This suggests that the thermal and 
mechanical unfolding pathways, although different in de- 
tail, can still be used as reasonable priors for each other. 



Summary and Discussion 

Single-molecule experiments are particularly useful 
when the molecule under study can take on several con- 
formations, e.g. folding intermediates, because in these 
cases bulk measurements usually yield only average prop- 
erties, while single-molecule measurements can charac- 
terize each of the different conformations. Here, we pre- 
dicted experimental signatures of such folding interme- 
diates in the quasi-equilibrium fluctuations of the end- 
to-end distance. We found that a local minimum in the 
fluctuations curve indicates a locally stable intermedi- 
ate state in the average mechanical unfolding pathway 
(which is well-defined and identical to the (re)folding 
pathway). The number of local minima in the fluctu- 
ations curve can therefore serve as a lower estimate of 
the number of locally stable intermediate states in this 
pathway. Furthermore, these signatures could be used to 
reconstruct the (un)folding pathway of an RNA molecule 
with a known secondary structure by cutting the se- 
quence at appropriate positions and individually probing 
different substructures of the molecule. 

We also showed that the stiffness of the force- 
measuring device plays a crucial role in determining the 
resolution of the quasi-equilibrium fluctuations curve. If 
the device is too soft, some local minima can be lost, 
while a too rigid device has a low fluctuation ampli- 
tude and therefore a low signal-to-noise ratio. The op- 
timal choice for the stiffness (see 'Results') leads to the 
best estimate for the number of locally stable interme- 
diate states along the pathway and increases the degree 
to which the pathway could be reconstructed with the 
"cutting approach" . The dependence on the stiffness has 
a simple intuitive explanation within the statistical me- 
chanics problem of a particle with an attached spring 
in a one-dimensional energy landscape (see Fig. A 
stiff force-measuring device corresponds to a rigid spring, 
which tends to localize the particle in valleys of the land- 
scape (i.e., locally stable intermediates) and forces it to 



make small jumps as it is pulled along the landscape. 
On the other hand, a soft spring allows the particle to 
spread out and make long jumps, effectively hiding the 
intermediates in the equilibrium curve. 

We hope that this view, which also explains the 
smoothness of quasi-e quilibrium FEC's for large RNA's 
llGerland et al.Ll2001t) . will be useful for the design and 
interpretation of future experiments. Of course, experi- 
ments typically record not only the average fluctuations, 
but the d etailed time -traces of the end-to-end distance 
(Liphardt et alJ . I200H) . These traces probe the kinetics 
of structural rearrangements in RNA, and could provide 
even more detailed resolution of intermediate states (if 
the kinetics is slow enough to be resolvable in the experi- 
ment). However, we expect that it is still helpful to make 
the choice for the stiffness of the force-measuring device 
as discussed above, in order to limit the number of states 
that contribute to each of the timetraces. 

Although single-molecule experiments on RNA have 
already produced remarkable results, there are many de- 
sirable future developments. For instance, it appears that 
the current approaches are not well-suited to measure the 
secondary structure of an unknown molecule (e.g., in the 
'cutting approach' one would not know where to cut and 
it would be too laborious to try all positions). Is there a 
direct way to mechanically measure the secondary struc- 
ture? Also, could one detect signatures of pseudoknots 
or even tertiary interactions? Wc hope that theoretical 
models of the type presented here can be useful for the 
planning and design of new experimental approaches in 
the future. 

Note. After completion of this work, we learned of a 
related theoretical study by S. Cocco, J.F. Marko, and 
R. Monasson (preprint, cond-mat/0207609). These au- 
thors present a kinetic model for the end-to-end distance 
fluctuations of stretched RNA molecules, which nicely 
complements our thermodynamic study. Very recently, 
A.F. Sauer-Budge, J. A. Nyamwand a, D.K. Lubensky, 
and D. Branton (preprint cond-mat/0209414) have ex- 



perimentally studied the unzipping kinetics of double- 
stranded DNA with a short internal loop, and demon- 
strated the presence of an intermediate state comparable 
to the one discussed here for the modified P5ab hairpin. 

Acknowledgments. We are grateful to S.M. Block, 
S. Dumont, H. Isambert, J. Liphardt, D.K. Lubensky, I. 
Tinoco, and S.A. Woodson for stimulating discussions. 
U.G. was supported in part by a fellowship from the 
'Deutscher Akademischer Austauschdienst'. R.B. and 
T.H. acknowledge support by the NSF through Grant 
No. DMR-9971456, DBL9970199, and the Beckmann 
foundation. 



References 

Banerjee, A.R., J. A. Jaeger, and D.H. Turner. 1993. Ther- 
mal unfolding of a group I ribozyme: the low-temperature 
transition is primarily disruption of tertiary structure. Bio- 
chemistry 32:153-163. 



10 



Bockelmann, U., B. Essevaz-Roulet, and F. Heslot. 1998. 

DNA strand separation studied by single molecule force 

measurements. Phys. Rev. E 58:2386-2394. 
Bustamante, C, J.F. Marko, E.D. Siggia, and S.B. Smith. 

1994. Entropic elasticity of lambda-phage DNA. Science 

265:1599-1600. 

Cech, T.R. 1993. Structure and mechanism of the large cat- 
alytic RNAs: group I and group II introns and ribonu- 
clease P. In The RNA World (Gesteland, R.F. & Atkins, 
J.F., eds). 239-269, Cold Spring Harbor Laboratory Press, 
Plainview, NY. 

Fernandez, J.M., S. Chu, A.F. Oberhauser. 2001. RNA struc- 
ture: Pulling on Hair(pins). Science 292:653-654. 

Gerland, U., R. Bundschuh, and T. Hwa. 2001. Force-induced 
denaturation of RNA. Biophys. J. 81:1324-1332. 

Heymann, B. and H. Grubmiiller. 2000. Dynamic Force Spec- 
troscopy of Molecular Adhesion Bonds. Phys. Rev. Lett. 
84:6126-6129. 

Hofacker, I.L., W. Fontana, P.F. Stadler, S. Bonhoef- 
fer, M. Tacker, and P. Schuster. 1994. Fast folding 
and comparison of RNA secondary structures. Monat- 
shefte f. Chemie 125:167-188; software available online at 
http : //www . tbi . univie . ac. at/ 

Isambert, H. and E.D. Siggia. 2000. Modeling RNA folding 
paths with pseudoknots: Application to hepatitis delta 
virus ribozyme. Proc. Natl. Acad. Sa. (USA ) 97:6515-6520. 

Jaeger, A.A., M. Zuker, and D.H. Turner. 1990. Melting and 
Chemical Modification of a Cyclized Self-Splicing Group 
I Intron: Similarity of Structures in 1 M Na"'", in 10 
mM Mg'^"'', and in the Presence of Substrate. Biochemistry 
29:10147-10158. 

Liphardt, J., B. Onoa, S.B. Smith, I. Tinoco, and C. Busta- 
mante. 2001. Reversible Unfolding of Single RNA Molecules 
by Mechanical Force. Science 292:733-737. 

Liphardt, J., S. Dumont, S.B. Smith, I. Tinoco, and C. Busta- 
mante. 2002. Equilibrium information from nonequilibrium 
measurements in an experimental test of Jarzynski's equal- 



ity. Science 296:1832-1835. 

Maier, B., D. Bensimon, and V. Croquette. 2000. Replication 
by a single DNA polymerase of a stretched single-stranded 
DNA. Proc. Natl. Acad. Sci. USA 97:12002-12007. 

McCaskill, J.S. 1990. The equilibrium partition function and 
base pair binding probabilities for RNA secondary struc- 
ture. Biopolymers 29:1105-1119. 

Montanari, A. and M. Mezard. 2001. Hairpin formation and 
elongation of biomolecules. Phys. Rev. Lett. 86:2178-2181. 

Pan, J. and S.A. Woodson. 1998. Folding Intermediates of 
a Self-splicing RNA: Mispairing of the Catalytic Core. J. 
Mol. Biol. 280:597-609. 

Rief, M., M. Gautel, F. Oesterheh, J.M. Fernandez, and H.E. 
Gaub. 1997. Reversible unfolding of individual titin im- 
munoglobulin domains by AFM. Science 276:1109-1112. 

SatnaLucia, J. Jr. 1998. A unified view of polymer , dumbbell, 
and oligonucleotide DNA nearest-neighbor thermodynam- 
ics. Proc. Natl. Acad. Sa. USA 95:1460-1465. 

Smith, S.B., Y. Cui, and C. Bustamante. 1996. Overstretching 
B-DNA: the elastic response of individual double-stranded 
and single-stranded DNA molecules. Science 271:795-799. 

Tinoco, I. Jr and C. Bustamante. 1999. How RNA folds. J. 
Mol. Biol. 293:271-281. 

Walter, A.E., D.H. Turner, J. Kim, M.H. Lyttle, P. MuUer, 
D.H. Mathews, and M. Zuker. 1994. Coaxial stacking of 
helixes enhances binding of oligoribonucleotides and im- 
proves predictions of RNA folding. Proc. Natl. Acad. Sci. 
USA 91:9218-9222. 

Zhuang, X., L.E. Bartley, H.P. Babcock, R. Russell, T. Ha, 
D. Herschlag, and S. Chu. 2000. A Single-Molecule Study 
of RNA Catalysis and Folding. Science 288:2048-2051. 

Zhuang, X., H. Kim, M.J.B. Pereira, H.P. Babcock, N.G. Wal- 
ter, and S. Chu. 2002. Correlating Structural Dynamics and 
Function in Single Ribozyme Molecules. Science 296:1473- 
1476. 



